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METHOD AND APPARATUS FOR GENERATING 
MODEL DATA FROM CAMERA IMAGES 

The present invention relates to an apparatus and method 
5 of operation thereof for generating model data of a model 
in a three-dimensional space from image data 
representative of a set of camera images of an object. 

It is known to create three-dimensional computer models 
10 of real objects based on the input of image data in the 
form of a series of image frames which may be derived 
from a series of photographs taken from different camera 
positions or from a video recording taken from a moving 
camera. 

15 

Having generated a set of model data, a model image is 
displayed and may be compared with camera images of the 
object from which the existing model data has been 
derived . 

20 

A first aspect of the present invention relates to 
refinement of the existing model data by allowing a user 
to identify an additional feature in one of the camera 
images, this feature being absent from the displayed 
2 5 model image, and which the user wishes to include in the 
model by the input of additional model data. 



One method of refining the model in this respect requires 
the user to continue the process of entering matching 
points identified in successive image frames and the 
apparatus to then process the matching point data by re- 
running the model program to incorporate an expanded set 
of data. This process however requires a substantial 
amount of computer processing effort with consequent 
delay. In some instances , the additional feature may 
only be visible in a single frame, making it impossible 
to identify a matching point in a second frame. 

The present invention seeks to provide an improved method 
and apparatus allowing an additional feature to be added 
as a result of user input based on a single frame. 

According to the present invention there is disclosed a 
method of operating an apparatus for generating model 
data representative of a model in a three dimensional 
space from image data representative of a set of camera 
images of an object; 

the apparatus performing the steps of; 

displaying a model image based on an existing set of 
model data; 

displaying one of the camera images of the object 
for selection by a user of an additional feature to be 
represented by additional model data; 

receiving an image point selection signal responsive 
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to user actuation of an input means and identifying co- 
ordinates of an image point in the camera image defining 
the selected additional feature; 

calculating a locus in the three dimensional space 
5 defining positions of possible model points corresponding 
to the image point and consistent with the geometric 
relationship between the object and a camera position 
from which the displayed camera image was taken; 

displaying a position indicator in the model image 
10 at co-ordinates in the model image corresponding to one 
of the possible model points on the locus; 

receiving positioning signals responsive to user 
actuation of the input means and updating the co- 
ordinates of the position indicator such that movement of 
15 the position indicator is constrained to follow a 
trajectory in the model image corresponding to the locus; 

receiving a model point selecting signal responsive 
to user actuation of the input means and determining 
selected co-ordinates of the position indicator to be the 
20 position indicator co-ordinates at the time of receiving 
the model point selecting signal; and 

determining co-ordinates of the additional model 
point in the three dimensional space corresponding to the 
selected co-ordinates of the position indicator. 

25 

In a preferred embodiment, the locus is a straight line 
in the three-dimensional model space, the straight line 
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being displayed in the model image as a visual aid to the 
user in editing the position of the new model point. 
After finalising the position of the new model point , a 
model generating process is initiated to incorporate the 
5 additional model point into the model data and to 
generate surface elements of the model, allowing the new 
model image to be displayed including the surface 
elements for comparison with the camera image. 



10 A second aspect of the present invention relates to the 
manner in which the model data is edited when a new model 
point is added to the existing set of model data, either 
using the above disclosed method or by other methods. 
Incorporation of an additional model point generally 

15 requires the surface elements of the existing model to be 
modified, at least one of the surface elements being 
replaced by a plurality of new elements which include the 
new model point. This aspect of the invention addresses 
the problem of selecting the surface element to be 

20 modified or replaced in a manner which is simple for the 
user to implement. 

According to the second aspect of the present invention 
there is disclosed a method of operating an apparatus for 
25 generating model data defining a model in a three 
dimensional space, the model data comprising co-ordinates 
defining model points and surface elements generated with 
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reference to the model points; the method comprising 
editing an existing set of model data by the steps of; 

adding a new model point to the existing set of 
model data; 

5 projecting the hew model point onto the model and 

identifying a selected one of the surface elements onto 
which the new model point is projected; 

identifying a subset of the model points which 
define the generation of the selected surface element; 
10 adding the new model point to the subset to form an 

edited subset of model points; and 

generating one or more edited surface elements from 
the edited subset of model points to replace the selected 
surface element. 

15 

The identification of the surface element to be replaced 
is thereby automatically implemented by the apparatus , by 
operating a computer program selected by the user. 

20 In a preferred embodiment , the projection of the new 
model point onto the model is processed by defining a 
centre of projection corresponding to one of the camera 
positions from which f rames of the camera image data were 
obtained. An interface allowing the user to select an 

25 appropriate camera position may comprise a display of a 
pictorial representation showing the relative positions 
of the object and the cameras, the camera positions being 
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represented by icons which may be selected by clicking a 
computer mouse or other input device. 



A further embodiment provides an alternative interface in 
5 which thumbnail images of the camera image frames are 
presented to the user, each thumbnail image constituting 
an icon allowing selection using a pointing device such 
as a computer mouse in conjunction with a moveable cursor 
on the display screen. 

10 

A third aspect of the present invention relates to the 
need to enable the user to evaluate the quality of a 
model in order to judge whether further refinement of the 
model data is required and to judge whether any editing 
15 procedure has been correctly effected or requires further 
editing. 

This aspect of the invention seeks to provide the user 
with an interface allowing the user to view a model image 
20 for comparison with a camera image, it being advantageous 
to present the user with compatible views for ease of 
comparison. The selection of the appropriate model image 
for comparison with a specific camera image may be time 
consuming and complex to the user. 

25 

According to the present invention there is disclosed a 
method of operating an apparatus for generating model 



data representative of a three dimensional model of an 
object from input signals representative of a set of 
camera images of the object taken from a plurality of 
camera positions, the method comprising; 
5 displaying a set of icons, each being associated 

with a respective one of the camera images of the object; 

receiving a selection signal responsive to user 
actuation of an input means whereby the selection signal 
identifies a selected one of the icons; 
10 determining a selected camera image from the set of 

camera images corresponding to the selected icon; 

displaying the selected image; 

determining position data representative of a 
selected camera position from which the selected image 
15 was taken; 

generating in accordance with said model a model 
image representative of a view of the model from a 
viewpoint corresponding to the position data; and 

displaying the model image for visual comparison 
20 with the selected image by the user. 



This method therefore allows the user to simply select a 
camera image using a set of icons and provides automatic 
processing using a computer program to generate a model 
25 image representative of a view of the model from a 
viewpoint corresponding to position data determined when 
the user selects a particular icon. 
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The icons may be representations of camera positions 
relative to a representation of the object being modelled 
or alternatively the icons may be thumbnail images of the 
frames of camera image data. 

5 

The user is thereby presented with a computer interface 
allowing correctly comparable model and camera images to 
be rapidly selected for evaluation. The selection 
process may thereby be repeated to view the images from 
10 different viewpoints in order to rapidly gain an overview 
of the quality of the model data as a basis for deciding 
whether further editing is required. 

Embodiments of the above aspects of the present invention 
15 will now be described by example only and with reference 
to the accompanying drawings of which; 

Figure 1 is a schematic representation of components of 
a modular system in which the present invention may be 
20 embodied; 

Figure 2 is a schematic representation of a model window; 

Figure 3 is a schematic representation of a camera image 
25 window in which a displayed camera image includes an 
additional feature which is not represented in the model 
image of Figure 2 ; 
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Figure 4 is a schematic representation of a calculated 
locus in the 3-D model space for a new model point; 



Figure 5 is a schematic representation of a model window 
5 including a new point moved by the user to positions 
constrained by the calculated locus; 

Figure 6 is a schematic representation of a model window 
during user selection of points for connection to the new 
10 model point; 

Figure 7 is a schematic representation of the model 
window in which the displayed model image shows the new 
model point and facets; 

15 

Figure 8 is a schematic representation of the model 
window showing the model image including the added model 
data, viewed from the same direction as the camera image 
of Figure 3 ; 

20 

Figure 9 is a schematic flowchart showing the method 
steps for adding the new model data; 

Figure 10 is a further general illustration of the 
2 5 apparatus including a display screen; 

Figure 11 is a representation of a model window including 
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a display of a line representing the calculated 
trajectory; 

Figure 12 is a schematic representation of the addition 
5 of a new model point to an existing model according to a 
second aspect of the present invention; 

Figure 13A is a schematic representation of a camera 
selection window using camera icons; 

10 

Figure 13B illustrates an alternative camera selection 
window using thumbnail icons; 

Figure 14 is a diagram illustrating the calculation of a 
15 ray intersecting a facet of the model; 

Figure 15 is a diagram illustrating the subdivision of a 
facet to include the added model point; 

20 Figure 16 is a diagram illustrating the display of a new 
model including the added point and new facets; 

Figure 17 is a flowchart illustrating the method 
described with reference to Figures 12 to 16; 

25 

Figure 18 is a flowchart illustrating the step of 
replacing the existing facets with new facets using re- 
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triangulat ion ; 

Figure 19 is a diagram illustrating the identification of 
co-ordinates in a camera image of a feature corresponding 
5 to the added model point; 

Figure 20 is a diagram illustrating the calculation of 
the intersection with the facet of a ray through the 
camera image point and the added model point; 

10 

Figure 21 is a flowchart illustrating the method 
described with reference to Figures 18 to 20; 

Figures 22 to 26 illustrate a third aspect of the present 
15 invention, Figure 22 illustrating schematically camera 
positions in relation to an object to be modelled; 

Figure 23 illustrates a display screen of a computer 
interface allowing viewpoints to be selected by a user 
20 for selecting both camera image and model image; 

Figure 24 is a flowchart illustrating the method of 
implementing the interface of Figure 23; 



25 



Figure 25 illustrates an alternative interface display 
allowing image selection using camera position icons; and 



12 2636201 
Figure 2 6 is a flowchart illustrating the operation of 
the interface of Figure 25. 



Figure 1 schematically shows the components of a modular 
5 system in which the present invention may be embodied. 

These components can be effected as processor-implemented 
instructions , hardware or a combination thereof. 

10 Referring to Figure 1, the components are arranged to 
process data defining images (still or moving) of one or 
more objects in order to generate data defining a three- 
dimensional computer model of the object(s). 

15 The input image data may be received in a variety of 
ways, such as directly from one or more digital cameras, 
via a storage device such as a disk or CD ROM, by 
digitisation of photographs using a scanner, or by 
downloading image data from a database, for example via 

20 a datalink such as the Internet, etc. 

The generated 3D model data may be used to: display an 
image of the object(s) from a desired viewing position; 
control manufacturing equipment to manufacture a model of 
25 the object (s), for example by controlling cutting 
apparatus to cut material to the appropriate dimensions; 
perform processing to recognise the object(s), for 
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example by comparing it to data stored in a database; 
carry out processing to measure the object(s), for 
example by taking absolute measurements to record the 
size of the object(s), or by comparing the model with 
5 models of the object(s) previously generated to determine 
changes therebetween; carry out processing so as to 
control a robot to navigate around the object(s); store 
information in a geographic information system (GIS) or 
other topographic database; or transmit the object data 

10 representing the model to a remote processing device for 
any such processing , either on a storage device or as a 
signal (for example, the data may be transmitted in 
virtual reality modelling language (VRML) format over the 
Internet, enabling it to be processed by a WWW browser); 

15 etc. 

The feature detection and matching module 2 is arranged 
to receive image data recorded by a still camera from 
different positions relative to the object (s) (the 
20 different positions being achieved by moving the camera 
and/or the object(s)). The received data is then 
processed in order to match features within the different 
images (that is, to identify points in the images which 
correspond to the same physical point on the object(s)). 

25 



The feature detection and tracking module 4 is arranged 
to receive image data recorded by a video camera as the 



relative positions of the camera and object(s) are 
changed (by moving the video camera and/or the 
object(s)). As in the feature detection and matching 
module 2, the feature detection and tracking module 4 
5 detects features, such as corners, in the images. 
However, the feature detection and tracking module 4 then 
tracks the detected features between frames of image data 
in order to determine the positions of the features in 
other images. 

10 

The camera position calculation module 6 is arranged to 
use the features matched across images by the feature 
detection and matching module 2 or the feature detection 
and tracking module 4 to calculate the transformation 
15 between the camera positions at which the images were 
recorded and hence determine the orientation and position 
of the camera focal plane when each image was recorded. 

The feature detection and matching module 2 and the 
2 0 camera position calculation module 6 may be arranged to 
perform processing in an iterative manner. That is, 
using camera positions and orientations calculated by the 
camera position calculation module 6, the feature 
detection and matching module 2 may detect and match 
2 5 further features in the images using epipolar geometry in 
a conventional manner, and the further matched features 
may then be used by the camera position calculation 
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module 6 to recalculate the camera positions and 
orientations . 

If the positions at which the images were recorded are 
5 already known, then/ as indicated by arrow 8 in Figure 1, 
the image data need not be processed by the feature 
detection and matching module 2, the feature detection 
and tracking module 4 f or the camera position calculation 
module 6. For example, the images may be recorded by 
10 mounting a number of cameras on a calibrated rig arranged 
to hold the cameras in known positions relative to the 
object ( s ) . 

Alternatively, it is possible to determine the positions 
15 of a plurality of cameras relative to the object(s) by 
adding calibration markers to the object (s) and 
calculating the positions of the cameras from the 
positions of the calibration markers in images recorded 
by the cameras. The calibration markers may comprise 
20 patterns of light projected onto the object(s). Camera 
calibration module 10 is therefore provided to receive 
image data from a plurality of cameras at fixed positions 
showing the object (s) together with calibration markers, 
and to process the data to determine the positions of the 
25 cameras. A preferred method of calculating the positions 
of the cameras (and also internal parameters of each 
camera, such as the focal length etc) is described in 
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"Calibrating and 3D Modelling with a Multi-Camera System" 
by Wiles and Davison in 1999 IEEE Workshop on Multi-View 
Modelling and Analysis of Visual Scenes, ISBN 0769501109. 

5 The 3D object surface generation module 12 is arranged to 
receive image data showing the object(s) and data 
defining the positions at which the images were recorded , 
and to process the data to generate a 3D computer model 
representing the actual surface(s) of the object(s), such 
10 as a polygon mesh model. 



The texture data generation module 14 is arranged to 
generate texture data for rendering onto the surface 
model produced by the 3D object surface generation 
15 module 12. The texture data is generated from the input 
image data showing the object (s). 

Techniques that can be used to perform the processing in 
the modules shown in Figure 1 are described in EP-A- 
20 0898245, EP-A-090 1 105 , pending US applications 09/129077, 
09/129079 and 09/129080, the full contents of which are 
incorporated herein by cross-reference, and also Annex A. 

The present invention may be embodied in particular as 
2 5 part of the feature detection and matching module 2 
(although it has applicability in other applications, as 
will be described later) . 
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Figure 10 illustrates generally the apparatus 100 of the 
present embodiment, comprising a processor 101, display 
monitor 102, and input devices including a computer mouse 
103 and keyboard 104. The mouse 103 enables signals such 
5 as an image point selection signal 112 (described below) 
to be input to the processor. 

A disc drive 105 also receives a floppy disc 106 carrying 
program code and/or image data for use by the processor 
10 101 in implementing the method steps of the present 
invention. 

The display monitor 102 has a display screen 107 which, 
in the present mode of operation of the program, displays 
15 a model window 108 and a camera image window 109. 

The processor 101 is connected to a modem 110 enabling 
program code or image data to be alternatively downloaded 
via the internet as an electronic signal 111. 

20 

The method steps according to a first aspect of the 
present embodiment are illustrated in Figure 9 in which 
steps performed by the user and by the apparatus are 
separated by a broken line 90 representing the interface 
25 provided by the display screen 107 and input devices 
103, 104. 



The method begins from a starting point at which the 
apparatus has already acquired a set of existing model 
data derived for example using the components in Figure 
1 to process input image data in the form of a series of 
5 image frames obtained from a camera at respective 
different camera positions. The model data includes a 
set of model points and surface elements and estimates of 
the camera positions in the form of model co-ordinates 
for camera centres and look-directions derived for 
10 example by operation of camera position calculation 
module 6 to calculate camera positions based on the image 
data. 

At step 91, the apparatus displays in the display screen 
15 107 a model image 20 in the model window 108 as 
illustrated in Figure 2. Also displayed for side by side 
comparison is a camera image 3 0 in the camera image 
window 109 as illustrated in Figure 3. 

20 The model image 2 0 of Figure 2 is rendered using existing 
model data which the user wishes to update in order to 
add additional model data representing an additional 
feature 31 which is visible in the camera image of Figure 
3 but which has no equivalent in the model image 20 of 

25 Figure 2. The model image 20 and camera image 30 as 
shown in Figures 2 and 3 are generated as views from 
substantially the same viewing direction. 
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At step 92, the user views the model image 20 and the 
camera image 30 and selects an image point 32 in the 
camera image 30 by using the computer mouse 103 to align 
a cursor 33 with the selected additional feature 31 and 
5 then clicking the mouse to generate an image point 
selection signal at step 93. 

At step 94 , the apparatus receives the image point 
selection signal and processes the signal to identify co- 
10 ordinates of the image point in the camera image 30. 

Since the camera image 30 is a two-dimensional projection 
of the object from which the model is derived, the two- 
dimensional co-ordinates obtained by user selection of 

15 the image point 32 do not specify uniquely a position in 
three dimensions at which the new model point is to be 
added. At step 95, the apparatus calculates the locus in 
three dimensions of the positions of possible model 
points corresponding to the selected image point 32 which 

20 are consistent with the geometric relationship between 
the object and the camera position from which the 
displayed camera image 30 was taken. This is illustrated 
in Figure 4 in which the model is viewed from a different 
viewpoint from that of Figure 2 and in which the locus is 

25 a straight line extending in the three dimensional space 
of the model from the model co-ordinates of the camera 
centre 40 and through the co-ordinates of the image point 



20 
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32 in the camera image plane 41. 

An exemplary model point 42 lying on the locus 43 is 
illustrated in Figure 4 at one of the possible positions 
5 at which the new model point could be added. 

At step 96 , the apparatus displays in the model window a 
new model image 21 as shown in Figure 5 in which a 
position indicator 50 lies on the locus 43 and is movable 

10 in response to movement of the computer mouse by the user 
so as to be constrained to follow a trajectory 51 
corresponding to the locus when projected into the plane 
of the model image 21. The new model image 21 of Figure 
5 is generated as a view of the model from a different 

15 viewpoint selected to clearly display the locus. Such 
different viewpoints are selected by the user by 
temporarily selecting a different mode of operation from 
a menu of available modes, the viewpoint selecting mode 
providing rotation of the model image in latitude and 

20 longitude in response to sideways and forward/reverse 
movement of the mouse respectively. 

At step 97 , the user views the model image 21 and the 
position indicator 50 and decides upon an appropriate 
2 5 position of the position indicator 5 0 to represent the 
additional feature 31. At step 98, the user actuates the 
mouse to move the position indicator 50 to the selected 



position , the apparatus updating the position of the 
position indicator appropriately at step 99 , and at step 
910 the user clicks the mouse, thereby selecting the 
desired position to set the position of the new model 
5 point. At step 911, the apparatus receives a selection 
input signal corresponding to the mouse click and freezes 
the position at which the position indicator 50 is 
displayed in the model image window. At step 912, the 
apparatus determines the three-dimensional co-ordinates 
10 corresponding to the selected position of the additional 
model point, the co-ordinates being uniquely identified 
in three-dimensions from the known geometry of the locus 
and the selected position in the two-dimensional 
projection forming the model image 21 of Figure 5. 

15 

At step 913, the apparatus adds the new model point to 
the existing model data and at step 914 displays the new 
model point 64 in the model window 10B together with 
existing model points, superimposed on the model image 20 
20 as shown in Figure 6. 

At step 915, the user views the model image and the new 
model point and selects a set of existing model points 
61, 62 and 63 for combining with the new model point 64 
25 to form a new subset of points to be used in the 
generation of surface elements of the model. The 
apparatus then generates the additional surface elements 



shown as elements 70 and 71 in Figure 7. Texture data 
may then be rendered onto the resulting surface model 
using a texture data generation module 14 as described 
above with reference to Figure 1. 

5 

Figure 8 illustrates the model image incorporating the 
added model data when viewed from the same direction as 
the original camera image of Figure 3 . In the model 
image of Figure 8, the additional feature 31 of the 
10 camera image 30 is represented by added model feature 80. 

The user may decide that the added model feature 80 does 
not adequately represent the additional feature 31 and, 
if so, may select an editing mode in which the position 

15 of the position indicator 50 may be adjusted and the 
resulting facetted model reviewed until the added model 
feature is judged to be correct, this further step 
requiring the input of further positioning signals and 
model point selecting signals responsive to user 

20 actuation of the mouse. 

In an alternative embodiment illustrated in Figure 11, 
the step 96 of displaying in the model window 108 the new 
model image 21 together with the indicator 50 may also 
25 include displaying a line 120 indicating the path of the 
trajectory 50. 
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Alternative embodiments are envisaged in which for 
example non-linear locus calculation is effected, for 
example to take account of image distortion known to be 
present in the camera optics. Alternative means may be 
5 utilised for the input of data in place of a computer 
mouse, alternative forms of pointing device such as touch 
screen and touch pad devices being usable, or 
alternatively conventional keyboard devices may be used 
to input co-ordinates. 

10 

In a further alternative embodiment, the step 915 in 
which the user selects existing model points for surface 
generation may be replaced by a step carried out by the 
apparatus to automatically select existing model points 
15 to be used in combination with the new model point as a 
basis for re-facetting the model. 

A preferred method of performing such automatic facetting 
will be described below. 

20 

The method of the present invention can be implemented by 
a computer program operating on the computer apparatus 
100, the program comprising processor implementable 
instructions for controlling the processor 101. The 
25 program may be stored in a storage medium such as floppy 
disk 106. An aspect of the present invention thus 
provides a storage medium storing processor implementable 
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instructions for carrying out the above described method - 

Further, the computer program may be obtained in 
electronic form for example by downloading the program 
5 code in the form of a signal 111 over a network such as 
the internet via the modem 38. 

Alternative embodiments of the present invention are 
envisaged in which for example the above described method 

10 and apparatus are used to process camera images obtained 
by selecting frames from a video camera recording , the 
frames representing different views of the object. The 
displayed images may additionally be modified to include 
dimensional information as a guide to the user in 

15 determining the optimum position of the new model point. 

A further aspect of the present embodiment will now be 
described , relating to the automatic re-facetting of the 
model when a new model point is added to a set of 
20 existing model points. Corresponding reference numerals 
to those of preceding figures will be used where 
appropriate for corresponding elements. 

Figure 12 illustrates a new model point 64 which has been 
25 added to the data used to derive a model image 20 
displayed in a model window 108 in a display screen 107 
of a processor controlled apparatus 10 0 of the type 



25 2636201 
illustrated in Figure 10 and functioning as a system in 
the manner described above with reference to Figure 1. 



The addition of the new model point 64 may be the result 
5 of a process using selection of a camera image point and 
generating a locus in the model space as described above 
with reference to Figures 2 to 11 or may be the result of 
a different process, such as for example the input via a 
keyboard of numerals representing co-ordinates in the 
10 three-dimensional model space. 

In Figure 12 , the model image 20 is representative of an 
irregularly shaped object represented schematically by a 
multi-facetted image in which the surface is comprised of 

15 a large number of triangular facets. In practice, the 
number of facets is likely to be greatly increased beyond 
the relatively small number illustrated in Figure 12 so 
that Figure 12 should therefore be regarded as schematic 
for the purpose of simplicity of representation in this 

20 respect. 

The method steps required to implement the method are 
illustrated in the flowchart of Figure 17 in which steps 
performed by the user are illustrated in the left-hand 
25 portion of the flowchart, steps implemented by the 
apparatus are shown in the right-hand portion of the 
flowchart and an interface between the user and the 
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apparatus is represented as a broken line 90. In 
practice , the interface is comprised of the display 
screen 107 and the computer mouse 10 6 allowing the input 
of pointing signals in conjunction with the display of a 
5 cursor 33 on the display screen 107. 



The following method steps illustrated in Figure 17 will 
be described with reference to Figures 12 to 16. At step 
170 , the user selects via mode icons 230 a mode of 
10 operation of the apparatus for choosing a view of the 
model and the apparatus responds by displaying the model 
image 20 in the model image window 108. The user 
actuates the mouse 103 to orient the model view to a 
position which is judged to be appropriate. 

15 

At step 17 1, the user selects a mode of operation for the 
addition of model points and the apparatus responds by 
displaying a prompt for the input of the model point 
information . The user inputs co-ordinates of the added 

20 model point and, at step 172, the apparatus displays the 
new model point in the model image window 108 as 
illustrated in Figure 12. The apparatus also displays on 
the display screen 107 a camera selection window 130 as 
illustrated in Figure 13A in which the camera positions 

2 5 relative to the object represented by the model image are 
graphically represented in a manner which enables the 
user to choose one of the cameras as being appropriately 
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located for the purpose of defining a centre of 
projection to allow the new model point 64 to be 
projected onto the existing model. The user may for 
example already have knowledge of the object being 
5 modelled and a general indication of the required camera 
view. 

In the camera selection window 130, the cameras are 
represented at their positions relative to a 
10 representation of the object 131 by respective camera 
icons 132 such that the user is able to select one of the 
cameras by use of the mouse , the user aligning the cursor 
33 onto a selected one of the camera icons and clicking 
the mouse 103 to effect selection. 

15 

At step 174 , the apparatus receives the camera selecting 
signal and determines the position of the camera centre 
147 in the three-dimensional co-ordinate system of the 
model . 

20 

At step 175, the apparatus calculates the manner in which 
the new model point 64 is projected onto the surface of 
the model by calculating a ray in the model space through 
the position of the camera centre and the co-ordinates of 
25 the new model point. As shown in Figure 14, a ray 140 
defined in the above manner intersects the surface of the 
model at a point of intersection 141 which lies within a 



28 2636201 

facet 142 defined by apices 143, 144 and 145 and also 
intersects a second facet 146 on exiting the model 
surface . 



5 At step 176 , the apparatus replaces the existing facet 
142 with new facets 150 , 151 and 152 as illustrated in 
Figure 15, each of which includes the new model point 64 
as a respective apex. At step 177 , the apparatus 
displays the new model image including the; added point 64 
10 and the new facets 150, 151 and 152 as illustrated in 
Figure 16 in which the new facets are highlighted by 
beihg cross-hatched (facet 152 is hidden from view) . 

Step 176 of replacing the existing facet with new facets 
15 is illustrated in greater detail in the flowchart of 
Figure 18. At step 180, the apparatus determines whether 
the ray 140 intersect one of the model facets. If no 
intersection occurs, the apparatus displays a prompt to 
the user to select a model facet at step 181 and at step 
20 182 the user responds by selecting a facet to be 
replaced, selection being carried out using the mouse and 
cursor. At step 183, the apparatus determines the set of 
co-ordinates upon which the selected facet is based and, 
at step 184, adds the new model point to this set of co- 
25 ordinates. In this example, since the facet being 
replaced is triangular, the set of co-ordinates on which 
the facet is based consists of three model points. When 
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the new model point is added, there are four model points 
as a basis for re-triangulation. At step 185, the 
apparatus performs re-triangulation to define three 
triangular facets which connect the set of four points to 
5 form part of the surface of the model as illustrated in 
Figure 15. 

If at step 180, the apparatus determines that the ray 
does in fact intersect a model facet 142 as shown in 

10 Figure 14, the point of intersection 141 is determined, 
thereby defining the facet 142 which is intersected by 
the ray, and the set of co-ordinates of the intersected 
facet are then used in combination with the new model 
point at step 184 to define the set of new co-ordinates. 

15 If, as in the case of Figure 14, more than one facet is 
intersected by the ray 140, the apparatus determines at 
step 185 which of the facets is closest to the new model 
point 64 as a subject for re-triangulation. In the 
example of Figure 14, the facet 142 is therefore selected 

20 in preference to facet 146 since it is closer to the new 
model point 64. 

Figure 13B illustrates an alternative method of selecting 
the camera position by using a camera selection window 
25 130 which includes a series of thumbnail icons 133, each 
thumbnail icon comprising a thumbnail image derived from 
the image data obtained from a respective camera 
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position. The user may thereby select from the displayed 
thumbnail images the appropriate camera position for 
viewing the required aspect of the object represented by 
the model image and by clicking the mouse 103 when cursor 
5 33 is on the thumbnail icon 133 , generates a pointing 
signal 112 received at step 174 of Figure 17 by the 
apparatus , thereby enabling the required camera position 
to be determined as a centre of projection. 



10 In the above described example , the centre of projection 
for projecting the new model point onto the surface of 
the model is defined as being the centre of the camera. 
The centre of projection may alternatively be defined in 
terms of the point in the image plane of the camera 

15 corresponding to the location of the image point 
corresponding to the new model point. For example, in 
Figure 19, a camera image 30 is displayed in a camera 
image window 190 to allow the user to select a camera 
image point 191 determined by the user to correspond to 

20 the new model point 64. As illustrated in the flowchart 
of Figure 21, the co-ordinates" of "the camera image point 
are input at step 210 to enable the apparatus to 
calculate at step 211 the ray in the model space through 
the co-ordinates of the added model point and camera 

25 image point as illustrated in Figure 2 0 where the 
position 200 of the camera image point in the camera 
plane 201 is used to determine the trajectory of the ray 
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140. 

Alternative devices may be used in place of the computer 
mouse 103 for the input of selection signals, including 
5 for example any conventional pointing device such as a 
touch screen or touch pad device. Alternatively , a 
keyboard 104 may be used for the input of commands or co- 
ordinates. 

10 In the method of Figure 17 , the user may choose to change 
from one mode to another at any time by selecting one of 
the mode icons 230. 

The method of the above aspect of the present invention 
15 described with reference to Figures 1, 10, and 12 to 20 
can be implemented by a computer program operating on the 
computer apparatus 100, the program comprising processor 
implementable instructions for controlling the processor 
101. The program may be stored in a storage medium such 
20 as floppy disk 106. An aspect of the present invention 
thus provides a storage medium storing processor 
implementable instructions for carrying out the above 
described method. 

2 5 Further, the computer program may be obtained in 
electronic form for example by downloading the program 
code as a signal 111 over a network such as the internet 
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via the modem 110, 

A further aspect of the present embodiment will now be 
described using corresponding reference numerals to those 
5 of preceding figures where appropriate for corresponding 
elements. This aspect of the embodiment relates to the 
provision of a method and apparatus enabling an interface 
to allow a user to evaluate the quality of a model of the 
type discussed above, and in particular of the type 
10 discussed with reference to Figure 1 using the apparatus 
described above with reference to Figure 10. 

As previously discussed, a user may adopt one of a number 
of techniques for refining and editing model data in 

15 order to achieve an improved model image. In order to 
evaluate the quality of the model image, this aspect of 
the embodiment allows views of the model image and camera 
image to be presented in respective model image windows 
and camera image windows on the display screen and for 

20 the respective images to be presented such that both the 
camera image and model image represent views of the 
object from substantially the same viewpoint and in 
respect of which substantially the same image settings 
such as magnification, field of view, etc, are provided 

2 5 (these latter parameters are referred to below as "camera 
intrinsics" ) . 
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Figure 22 illustrates the relationship between a physical 
object 220 which is the subject of the modelling exercise 
and a set of camera positions L(i), relative to the 
object 220 , from which a set of frames of image data are 
5 obtained, a corresponding camera image I(i) being 
obtained. The camera images may be obtained by moving a 
single camera successively into the camera positions 
L(i), by having a set of different cameras or by moving 
the object relative to a stationary camera, for example. 

10 

Having obtained model data allowing model images to be 
displayed, the user wishes to evaluate the model by 
displaying side by side a camera image and a model image. 
In Figure 22, camera position L(3) is of particular 
15 interest to the user. 

Using the apparatus of Figure 10, the user operates the 
apparatus to achieve this result using the method steps 
illustrated in the flowchart of Figure 24 which will be 
20 illustrated below with reference to Figure 23. 

At step 240, the user selects the required mode of 
operation for displaying camera and model images for the 
purpose of evaluation, mode selection being achieved 
25 using the interface provided by the display screen 107, 
the cursor 33 and the mouse 103 to select one of the mode 
icons 230 located in a peripheral region of the display 
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screen as shown in Figure 23. 

At step 2 41, the apparatus generates camera image data 
for each of the frames of image data, using the thumbnail 
5 image format, and displays the thumbnail images as icons 
231 within an icon window 232 of the display screen 107. 

The icons 231 are displayed in a sequence as calculated 
by camera position calculation module 6 which corresponds 

10 to the spatial relationship of the positions L(i) as 
shown in Figure 22, so that the sequence L(i), i = 1 to 
n progressing from left to right is maintained in the 
layout of the icons on the display screen 107 such that 
images I(i)/ i = 1 to n, are positioned from left to 

15 right according to the value of i. 

For simplicity of representation, the images shown in 
Figure 23 are those of a regular polyhedron in which an 
x is drawn on one of the faces so that the apparent 
2 0 position of the x in each of the displayed thumbnail 
images corresponds to the view which would be obtained 
from the camera positions L(i). 

At step 242 the user views the icons and at step 243 the 
25 user selects one of the icons as being of particular 
relevance for the purpose of evaluation of the images. 
The user selects the icon as indicated in Figure 23 by 
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the cursor 33 overlaying the third image , i = 3, 
corresponding to selection of the camera position L(3) of 
Figure 22. 

5 At step 244, the apparatus receives the icon selection 
input and at step 245, the apparatus identifies the 
selected camera image for display in a camera image 
window 109. At step 246, the apparatus determines the 
position data for the selected camera by accessing data 

10 stored with the camera image data and at step 247 
calculates the model image data using the selected 
position data to define the viewpoint for the model. In 
calculating the model image data, the apparatus also uses 
camera intrinsic parameters stored with the camera image 

15 data. The intrinsic parameters of the camera comprise 
the focal length, the pixel aspect ratio, the first order 
radial distortion coefficient, the skew angle (between 
the axes of the pixel grid) and the principal point (at 
which the camera optical axis intersects the viewing 

20 plane) . 

At step 248, the apparatus displays a model image 20 in 
the model image window 108 and the camera image 30 in a 
camera image window 109, thereby allowing the user to 
25 view and compare the selected camera image and the model 
image as calculated from a corresponding viewpoint. 



In Figure 23 , the icons 231 are linked in series by links 
233. If necessary, a large number of such icons may be 
displayed in an array comprising a number of rows, 
maintaining the links between successive icons in order 
5 to visually indicate the continuity of the sequence (i.e. 
the direction of increasing i). The use of such links 
therefore assists in providing the user with an 
indication of where the most appropriate image is to be 
selected. 

10 

After viewing the images for a selected viewpoint, the 
user may then choose to view camera and model images for 
different viewpoints by selecting different icons, 
repeating step 243 of Figure 24, and resulting in the 
15 apparatus repeating steps 244 to 248 to enable the 
further views to be seen. 



If the user then decides that the model data requires 
editing, the user may then select a different mode of 
20 operation by selecting the appropriate mode icon 230 for 
further operation of the apparatus. 

An alternative embodiment will now be described with 
reference to Figure 25 and the flowchart of Figure 26. 
25 Referring to Figure 26, at step 260 the user selects a 
required mode of operation by selecting the appropriate 
mode icon 230 of Figure 25. The apparatus responds by 
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genera-ting and displaying icons 250 in a camera position 
window 251. 

Within the camera position window 251, a display 
5 generated by the apparatus at step 261 comprises a 
representation 252 of the object based upon the model 
data together with representations of cameras at 
positions L(i), i = 1 to n, such that the relative 
positions of the cameras and the representation 252 
10 correspond to the calculated camera positions developed 
by the camera position calculation module 6 of Figure 1 . 
The representation 252 is thereby placed at the origin of 
the co-ordinate system of the model and the icons 250 
located in effect at the calculated camera positions. 

15 

This representation of the relative positions of the 
cameras and object allows the user to easily select a 
viewing point for the camera and model images to be 
displayed. In order to select a particular viewpoint, 

20 the user at step 2 62 views the icons 250 within the 
window 251 and at step 26 3 selects one of the icons at 
the desired camera position. The apparatus responds at 
step 265 by identifying the camera image data 
corresponding to the selected camera position. At step 

25 266, the apparatus then proceeds to calculate the model 
image data using the selected position data as a 
viewpoint and using camera intrinsic parameters stored in 



conjunction with the 
265. 
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data identified in step 



At step 267, the apparatus then displays the model image 
5 in model image window 108 and the camera image 30 in 
camera image window 109 to be viewed by the user at step 
2 68. The user is then able to evaluate the quality of 
the image by comparison between the images. 

10 In each of the display interfaces of Figures 23 and 25, 
the camera image window 109 and the model image window 
108 may be moved relative to one another using a drag and 
drop method by means of actuating the mouse. Similarly, 
the icon windows 232 and 251 may be moved relative to the 

15 image windows 108 and 109, thereby allowing the user to 
arrange the windows for maximum ease of selection and 
comparison. 

The method of the present invention can be implemented by 
2 0 a computer program operating on the computer apparatus 
100, the program comprising processor implementable 
instructions for controlling the processor 101. The 
program may be stored in a storage medium such as floppy 
disk 106. An aspect of the present invention thus 
2 5 provides a storage medium storing processor implementable 
instructions for carrying out the above described method. 
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Further, the computer program may be obtained in 
electronic form for example by downloading the program 
code as a signal 111 over a network such as the internet 
via the modem 110. 
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ANNEX A 

1. CORNER DETECTION 
5 1.1 Summary 

This process described below calculates corner points , to 
sub-pixel accuracy, from a single grey scale or colour 
image. It does this by first detecting edge boundaries in 
10 the image and then choosing corner points to be points 
where a strong edge changes direction rapidly. The 
method is based on the facet model of corner detection , 
described in Haralick and Shapiro 1 . 

15 1.2 Algorithm 

The algorithm has four stages: 

(1) Create grey scale image (if necessary); 

2 0 (2) Calculate edge strengths and directions; 

(3) Calculate edge boundaries; 

(4) Calculate corner points. 

1.2.1 Create grey scale image 

25 

The corner detection method works on grey scale images. 
For colour images , the colour values are first converted 
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to floating point grey scale values using the formula: 

grey_scale = (0.3 x red) +(0.59 x green) +(0.11 x blue) 

. • . • A* - 1 

5 

This is .the standard definition of brightness as defined 
by NTSC and described in Foley and van Dam 11 . 

10 1.2.2 Calculate edge strengths and directions 

The edge strengths and directions are calculated using 
the 7x7 integrated directional derivative gradient 
operator discussed in section 8.9 of Haralick and 
15 Shapiro 1 . 

The row and column forms of the derivative operator are 
both applied to each pixel in the grey scale image. The 
results are combined in the standard way to calculate the 
20 edge strength and edge direction at each pixel. 

The output of this part of the algorithm is a complete 
derivative image . 



25 



1.2.3 Calculate edge boundaries 
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The edge boundaries are calculated by using a zero 
crossing edge detection method based on a set of 5x5 
kernels describing a bivariate cubic fit to the 
neighbourhood of each pixel. 

5 

The edge boundary detection method places an edge at all 
pixels which are close to a negatively sloped zero 
crossing of the second directional derivative taken in 
the direction of the gradient, where the derivatives are 
10 defined using the bivariate cubic fit to the grey level 
surface. The subpixel location of the zero crossing is 
also stored along with the pixel location. 

The method of edge boundary detection is described in 
15 more detail in section 8.8.4 of Haralick and Shapiro 1 . 

1.2.4 Calculate corner points 

The corner points are calculated using a method which 
20 uses the edge boundaries calculated in the previous 
step. 
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Corners are associated with two conditions: 

(1) the occurrence of an edge boundary; and 

5 (2) significant changes in edge direction. 

Each of the pixels on the edge boundary is tested for 
"cornerness" by considering two points equidistant to it 
along the tangent direction. If the change in the edge 
10 direction is greater than a given threshold then the 
point is labelled as a corner. This step is described in 
section 8.10.1 of Haralick and Shapiro 1 . 

Finally the corners are sorted on the product of the edge 
15 strength magnitude and the change of edge direction. The 
top 200 corners which are separated by at least 5 pixels 
are output. 

2. FEATURE TRACKING 

20 

2 . 1 Summary 
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This process described below tracks feature points 
(typically corners) across a sequence of grey scale or 
colour images . 

5 The tracking method uses a constant image velocity Kalman 
filter to predict the motion of the corners, and a 
correlation based matcher to make the measurements of 
corner correspondences . 

10 The method assumes that the motion of corners is smooth 
enough across the sequence of input images that a 
constant velocity Kalman filter is useful, and that 
corner measurements and motion can be modelled by 
gaussians . 

15 

2 . 2 Algorithm 

1) Input corners from an image. 
2 0 2) Predict forward using Kalman filter. 



3) If the position uncertainty of the predicted corner 
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is greater than a threshold, A, as measured by the 
state positional variance, drop the corner from the 
list of currently tracked corners. 

5 4) Input a new image from the sequence. 

5 ) For each of the currently tracked corners : 

a) search a window in the new image for pixels 
which match the corner; 
10 b) update the corresponding Kalman filter, using 

any new observations (i.e. matches). 

6 ) Input the corners from the new image as new points 
to be tracked (first, filtering them to remove any 

15 which are too close to existing tracked points) . 

7) Go back to (2) 



20 



2.2.1 Prediction 

This uses the following standard Kalman filter equations 
for prediction, assuming a constant velocity and random 
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uniform gaussian acceleration model for the dynamics: 



X = e , X ... .A-2 

n+1 n + l,n n 



K = e . K e T +1 +Q n A-3 

where x is the 4D state of the system, (defined by the 
5 position and velocity vector of the corner), K is the 
state covariance matrix, 0 is the transition matrix, and 
Q is the process covariance matrix. 

In this model, the transition matrix and process 
10 covariance matrix are constant and have the following 
values : 



n*l,n [0 I) 



0 0 

0 



. A-5 



15 



2-2.2 Searching and matching 
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This uses the positional uncertainty (given by the top 
two diagonal elements of the state covariance matrix, K) 
to define a region in which to search for new 
measurements (i.e. a range gate ) . 

5 

The range gate is a rectangular region of dimensions: 
Ax = Jk^, Ay = jKr 2 A-6 

10 

The correlation score between a window around the 
previously measured corner and each of the pixels in the 
range gate is calculated. 

15 The two top correlation scores are kept. 

If the top correlation score is larger than a threshold, 
C 0 , and the difference between the two top correlation 
scores is larger than a threshold AC, then the pixel with 
20 the top correlation score is kept as the latest 
measurement . 



2.2.3 Update 
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The measurement is used to update the Kalman filter in 

the standard way: 

G = KH T (HKH T +R) _1 A ~ 7 

x^x+G(x-Hx) • • • -A-8 
Kr+{I-GH)K A ~ 9 

where G is the Kalman gain, H is the measurement matrix, 
and R is the measurement covariance matrix. 

In this implementation, the measurement matrix and 
measurement covariance matrix are both constant, being 
given by: 

H = (10) A-10 

R ~ o 2 I A-ll 

2.2.4 Parameters 



2 0 The parameters of the algorithm are: 



10 



15 



Initial conditions: x 0 and K 0 . 
Process velocity variance: a v 2 . 
Measurement variance : a 2 . 
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Position 



uncertainty threshold for loss 



of 



track: A. 



Covariance threshold: C 0 . 



Matching ambiguity threshold: AC, 



5 



For the initial conditions, the position of the first 
corner measurement and zero velocity are used, with an 
initial covariance matrix of the form: 



o 0 2 is set to a 0 2 = 2 0 0 (pixels/frame) 2 . 

The algorithm's behaviour over a long sequence is anyway 
15 not too dependent on the initial conditions* 

The process velocity variance is set to the fixed value 
of 50 (pixels/frame) 2 . The process velocity variance 
would have to be increased above this for a hand-held 
20 sequence. In fact it is straightforward to obtain a 
reasonable value for the process velocity variance 
adaptively . 



0 



0 




A-12 



10 
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The measurement: variance is obtained from the following 
model: 

a 2 = (rK+a) A- 13 

where K = ^(K 11 K 22 ) is a measure of the positional 
5 uncertainty , r is a parameter related to the likelihood 
of obtaining an outlier, and a is a parameter related to 
the measurement uncertainty of inliers . M r" and "a" are 
set to r=0 . 1 and a=1.0. 

10 This model takes into account, in a heuristic way, the 
fact that it is more likely that an outlier will be 
obtained if the range gate is large. 

The measurement variance (in fact the full measurement 
15 covariance matrix R) could also be obtained from the 
behaviour of the auto-correlation in the neighbourhood of 
the measurement. However this would not take into 
account the likelihood of obtaining an outlier. 

2 0 The remaining parameters are set to the values: A=4 00 
pixels 2 , C 0 =0.9 and AC=0.001. 
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3 . 3D SURFACE GENERATION 
3.1 Architecture 

5 

In the method described below , it is assumed that the 
object can be segmented from the background in a set of 
images completely surrounding the object. Although this 
restricts the generality of the method , this constraint 
10 can often be arranged in practice, particularly for small 
objects. 

The method consists of five processes, which are run 
consecutively: 

15 

- First, for all the images in which the camera 
positions and orientations have been calculated, 
the object is segmented from the background, using 
colour information. This produces a set of binary 
2 0 images, where the pixels are marked as being either 

object or background . 
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- The segmentations are used, together with the 
camera positions and orientations, to generate a 
voxel carving, consisting of a 3D grid of voxels 
enclosing the object. Each of the voxels is marked 

5 as being either object or empty space. 

- The voxel carving is turned into a 3D surface 
triangulation, using a standard triangulation 
algorithm (marching cubes). 

10 

The number of triangles is reduced substantially by 
passing the triangulation through a decimation 
process . 

15 - Finally the triangulation is textured, using 
appropriate parts of the original images to provide 
the texturing on the triangles. 

20 

3 . 2 Segmentation 
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The aim of this process is to segment an object (in front 
of a reasonably homogeneous coloured background) in an 
image using colour information. The resulting binary 
image is used in voxel carving. 

5 

Two alternative methods are used: 



Method Is input a single RGB colour value 
representing the background colour - each RGB pixel 
10 in the image is examined and if the Euclidean 

distance to the background colour (in RGB space) is 
less than a specified threshold the pixel is 
labelled as background (BLACK). 



15 Method 2: input a "blue" image containing a 

representative region of the background. 



The algorithm has two stages: 



20 (1) Build a hash table of quantised background colours 



(2) Use the table to segment each image. 
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Step 1) Build hash table 

Go through each RGB pixel, p, in the "blue" background 
image. 

5 

Set q to be a quantised version of p. Explicitly: 

q = (p+t/2) / t A " 14 

where t is a threshold determining how near RGB values 
need to be to background colours to be labelled as 
1 0 background . 

The quantisation step has two effects: 

1) reducing the number of RGB pixel values, thus 
15 increasing the efficiency of hashing; 

2) defining the threshold for how close a RGB pixel 
has to be to a background colour pixel to be 
labelled as background. 

20 

q is now added to a hash table (if not already in the 
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table) using the (integer) hashing function 

h(q) = (q_red & 7) 6+ {q_green & 7) *2"3+ {qjzlue & 7) 

A-15 

5 

That is, the 3 least significant bits of each colour 
field are used. This function is chosen to try and 
spread out the data into the available bins. Ideally 
each bin in the hash table has a small number of colour 
10 entries. Each quantised colour RGB triple is only added 
once to the table (the frequency of a value is 
irrelevant) . 

Step 2 ) Segment each image 

15 

Go through each RGB pixel, v, in each image. 

Set w to be the quantised version of v as before. 

20 To decide whether w is in the hash table, explicitly look 
at all the entries in the bin with index h(w) and see if 
any of them are the same as w. If yes, then v is a 
background pixel - set the corresponding pixel in the 
output image to BLACK. If no then v is a foreground 



56 2636201 
pixel - set the corresponding pixel in the output image 
to WHITE 



Post Processing: For both methods a post process is 
5 performed to fill small holes and remove small isolated 
regions . 

A median filter is used with a circular window. (A 
circular window is chosen to avoid biasing the result in 
10 the x or y directions). 

Build a circular mask of radius r. Explicitly store the 
start and end values for each scan line on the circle. 

15 Go through each pixel in the binary image. 

Place the centre of the mask on the current pixel. Count 
the number of BLACK pixels and the number of WHITE pixels 
in the circular region. 

20 

If (#WHITE pixels £ # BLACK pixels) then set corresponding 
output pixel to WHITE. Otherwise output pixel is BLACK. 
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3 . 3 Voxel carving 



The aim of this process is to produce a 3D voxel grid, 
5 enclosing the object, with each of the voxels marked as 
either object or empty space. 

The input to the algorithm is: 

10 a set of binary segmentation images, each of which 

is associated with a camera position and 
orientation; 

j 

2 sets of 3D co-ordinates, (xmin, ymin, zmin) and 
15 (xmax, ymax, zmax) , describing the opposite 

vertices of a cube surrounding the object; 

a parameter, n, giving the number of voxels 
required in the voxel grid. 



20 



A pre-processing step calculates a suitable size for the 
voxels (they are cubes) and the 3D locations of the 
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voxels, using n, (xmin, ymin, zmin) and (xmax, ymax, 
zmax) . 

Then, for each of the voxels in the grid, the mid-point 
5 of the voxel cube is projected into each of the 
segmentation images. If the projected point falls onto 
a pixel which is marked as background, on any of the 
images, then the corresponding voxel is marked as empty 
space, otherwise it is marked as belonging to the object. 

10 

Voxel carving is described further in "Rapid Octree 
Construction from Image Sequences" by R. Szeliski in 
CVGIP: Image Understanding, Volume 58, Number 1, July 
1993, pages 23-32. 

15 

3.4 Marching cubes 

The aim of the process is to produce a surface 
triangulation from a set of samples of an implicit 
2 0 function representing the surface (for instance a signed 
distance function) . In the case where the implicit 
function has been obtained from a voxel carve, the 
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implicit function takes the value -1 for samples which 
are inside the object and +1 for samples which are 
outside the object. 

5 Marching cubes is an algorithm that takes a set of 
samples of an implicit surface (e.g. a signed distance 
function) sampled at regular intervals on a voxel grid, 
and extracts a triangulated surface mesh. Lorensen and 
Cline i:Li and Bloomenthal iv give details on the algorithm 
10 and its implementation. 

The marching-cubes algorithm constructs a surface mesh by 
"marching" around the cubes while following the zero 
crossings of the implicit surface f(x)=0, adding to the 

15 triangulation as it goes. The signed distance allows the 
marching-cubes algorithm to interpolate the location of 
the surface with higher accuracy than the resolution of 
the volume grid. The marching cubes algorithm can be 
used as a continuation method (i.e. it finds an initial 

20 surface point and extends the surface from this point). 

3 . 5 Decimation 
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The aim of the process is to reduce the number of 
triangles in the model, making the model more compact and 
therefore easier to load and render in real time. 

5 The process reads in a triangular mesh and then randomly 
removes each vertex to see if the vertex contributes to 
the shape of the surface or not. (i.e. if the hole is 
filled, is the vertex a "long" way from the filled hole). 
Vertices which do not contribute to the shape are kept 
10 out of the triangulation. This results in fewer vertices 
(and hence triangles) in the final model. 

The algorithm is described below in pseudo-code. 
15 INPUT 



Read in vertices 



Read in triples of vertex IDs making up triangles 



PROCESSING 



20 



Repeat NVERTEX times 

Choose a random vertex, V, which hasn't been 



chosen be fore 



Locate set of all triangles having V as a 



vertex, S 



25 



Order S so adjacent triangles are next to each 
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other 

Re-triangulate triangle set, ignoring V (i.e. 
remove selected triangles & V and then fill in 
hole) 

Find the maximum distance between V and the 
plane of each triangle 
If (distance < threshold) 

Discard V and keep new triangulation 

Else 

Keep V and return to old triangulation 

OUTPUT 

Output list of kept vertices 
Output updated list of triangles 
15 

The process therefore combines adjacent triangles in the 
model produced by the marching cubes algorithm, if this 
can be done without introducing large errors into the 
model . 

20 

The selection of the vertices is carried out in a random 
order in order to avoid the effect of gradually eroding 
a large part of the surface by consecutively removing 
neighbouring vertices . 

25 



5 



10 



3 . 6 Further Surface Generation Techniques 
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Further techniques which may be employed to generate a 3D 
computer model of an object surface include voxel 
colouring, for example as described in "Photorealistic 
Scene Reconstruction by Voxel Coloring" by Seitz and Dyer 
5 in Proc. Conf. Computer Vision and Pattern Recognition 

1997 , pl067-1073, "Plenoptic Image Editing" by Seitz and 
Kutulakos in Proc. 6th International Conference on 
Computer Vision , pp 17-24 , "What Do N Photographs Tell Us 
About 3D Shape?" by Kutulakos and Seitz in University of 

10 Rochester Computer Sciences Technical Report 680, January 

1998, and "A Theory of Shape by Space Carving" by 
Kutulakos and Seitz in University of Rochester Computer 
Sciences Technical Report 692, May 1998. 

15 4. TEXTURING 

The aim of the process is to texture each surface polygon 
(typically a triangle) with the most appropriate image 
texture. The output of the process is a VRML model of 
20 the surface, complete with texture co-ordinates. 

The triangle having the largest projected area is a good 
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triangle to use for texturing, as it is the triangle for 
which the texture will appear at highest resolution. 



A good approximation to the triangle with the largest 
5 projected area, under the assumption that there is no 
substantial difference in scale between the different 
images, can be obtained in the following way. 

For each surface triangle, the image "i" is found such 
10 that the triangle is the most front facing (i.e. having 
the greatest value for n t .^i, where fi t is the triangle 
normal and v A is the viewing direction for the "i" th 
camera). The vertices of the projected triangle are then 
used as texture co-ordinates in the resulting VRML model. 

15 

This technique can fail where there is a substantial 
amount of self-occlusion, or several objects occluding 
each other. This is because the technique does not take 
into account the fact that the object may occlude the 
2 0 selected triangle. However, in practice this does not 
appear to be much of a problem. 
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It has been found that, if every image is used for 
texturing then this can result in very large VRML models 
being produced. These can be cumbersome to load and 
render in real time. Therefore, in practice, a subset of 
5 images is used to texture the model. This subset may be 
specified in a configuration file. 
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CLAIMS 

1. A method of operating an apparatus for generating 
model data representative of a model in a three 
dimensional space from image data representative of a set 
5 of camera images of an object; 

the apparatus performing the steps of; 

displaying a model image based on an existing set of 
model data; 

displaying one of the camera images of the object 
10 for selection by a user of an additional feature to be 
represented by additional model data; 

receiving an image point selection signal responsive 
to user actuation of an input means and identifying co- 
ordinates of an image point in the camera image defining 
15 the selected additional feature; 

calculating a locus in the three dimensional space 
defining positions of possible model points corresponding 
to the image point and consistent with the geometric 
relationship between the object and a camera position 
20 from which the displayed camera image was taken; 

displaying a position indicator in the model image 
at co-ordinates in the model image corresponding to one 
of the possible model points on the locus; 

receiving positioning signals responsive to user 
25 actuation of the input means and updating the co- 
ordinates of the position indicator such that movement of 
the position indicator is constrained to follow a 
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trajectory in the model image corresponding to the locus; 

receiving a model point selecting signal responsive 
to user actuation of the input means and determining 
selected co-ordinates of the position indicator to be the 
5 position indicator co-ordinates at the time of receiving 
the model point selecting signal; and 

determining co-ordinates of the additional model 
point in the three dimensional space corresponding to the 
selected co-ordinates of the position indicator. 

10 

2 . A method as claimed in claim 1 including displaying 
in the model image a line representing the locus. 

3. A method as claimed in any preceding claim wherein 
15 the locus is a straight line in the three dimensional 

space. 

4. A method as claimed in any preceding claim wherein 
the input means comprises a computer mouse and wherein 

20 said positioning signals are responsive to user actuation 
of the mouse by clicking the mouse during movement and 
releasing the mouse at a selected position to generate 
the model point selecting signal. 

25 5. A method as claimed in any preceding claim including 
the step of displaying a symbol representative of the 
additional model point at a model image point 
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corresponding to the selected co-ordinates. 

6. A method as claimed in claim 5 comprising the 
further step of editing the position of the additional 

5 model point in response to receiving further positioning 
signals and model point selecting signals responsive to 
user actuation of the input means. 

7 . A method as claimed in any preceding claim including 
10 the further step of receiving a processing instruction 

signal and, responsive to said signal, implementing a 
model generating process to incorporate the additional 
model point into the model data. 

15 8. A method as claimed in claim 7 including the step of 
generating surface elements of the model from the model 
data including the additional model point and displaying 
said surface elements in the model image. 

20 9. Apparatus for generating model data representative 
of a model in a three dimensional space from image data 
representative of a set of camera images of an object; 
the apparatus comprising; 

an interface comprising display means operable to 
25 display images to a user and input means responsive to 
user actuation; 

control means operable to control the display means 
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to display a model image based on an existing set of 
model data and to display one of the camera images of the 
object for selection by a user of an additional feature 
to be represented by additional model data; 
5 receiving means for receiving an image point 

selection signal responsive to user actuation of the 
input means and identifying co-ordinates of an image 
point in the camera image defining the selected 
additional feature ; 

10 calculating means for calculating a locus in the 

three dimensional space defining positions of possible 
model points corresponding to the image point and 
consistent with the geometric relationship between the 
object and a camera position from which the displayed 

15 camera image was taken; 

the control means being further operable to control 
the display means to display a position indicator in the 
model image at co-ordinates in the model image 
corresponding to one of the possible model points on the 

2 0 locus; 

the apparatus further comprising means for receiving 
positioning signals responsive to user actuation of the 
input means and updating the co-ordinates of the position 
indicator such that movement of the position indicator is 
2 5 constrained to follow a trajectory in the model image 
corresponding to the locus ; 

means for receiving a model point selecting signal 
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responsive to user actuation of the input means and 
determining selected co-ordinates of the position 
indicator to be the position indicator co-ordinates at 
the time of receiving the model point selecting signal; 
5 and 

means for determining co-ordinates of the additional 
model point in the three dimensional space corresponding 
to the selected co-ordinates of the position indicator. 

10 10. Apparatus as claimed in claim 9 wherein the control 
means is operable to control the display means to display 
in the model image a line representing the locus. 

11. Apparatus as claimed in any of claims 9 and 10 
15 wherein the calculating means is operable to calculate 

the locus as a straight line in the three dimensional 
space. 

12. Apparatus as claimed in any of claims 9 to 11 
20 wherein the input means comprises a computer mouse and 

wherein said positioning signals are responsive to user 
actuation of the mouse by clicking the mouse during 
movement and releasing the mouse at a selected position 
to generate the model point selecting signal. 

25 

13. Apparatus as claimed in any of claims 9 to 12 
wherein the control means is operable to control the 
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display means to display a symbol representative of the 
additional model point at a model image point 
corresponding to the selected co-ordinates. 

5 14. Apparatus as claimed in claim 13 comprising editing 
means for editing the position of the additional model 
point in response to receiving further positioning 
signals and model point selecting signals responsive to 
user actuation of the input means. 

10 

15. Apparatus as claimed in any of claims 9 to 14 
including model generating means operable to receive a 
processing instruction signal and, responsive to said 
signal, to implement a model generating process to 

15 incorporate the additional model point into the model 
data . 

16. Apparatus as claimed in claim 15 wherein the model 
generating means is operable to generate surface elements 

20 of the model from the model data including the additional 
model point and wherein the control means is operable to 
control the display to display said surface elements in 
the model image. 

25 17. A computer program comprising processor 
implementable instructions for carrying out a method as 
claimed in any of claims 1 to 8 . 
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18. A storage medium storing processor implementable 
instructions for controlling a processor to carry out a 
method as claimed in any of claims 1 to 8 . 

5 19. An electrical signal carrying processor 
implementable instructions for controlling a processor to 
carry out a method as claimed in any of claims 1 to 8. 

20. A method of operating an apparatus for generating 
10 model data defining a model in a three dimensional space, 
the model data comprising co-ordinates defining model 
points and surface elements generated with reference to 
the model points; the method comprising editing an 
existing set of model data by the steps of; 
15 adding a new model point to the existing set of 

model data; 

projecting the new model point onto the model and 
identifying a selected one of the surface elements onto 
which the new model point is projected; 
20 identifying a subset of the model points which 

define the generation of the selected surface element; 

adding the new model point to the subset to form an 
edited subset of model points; and 

generating one or more edited surface elements from 
2 5 the edited subset of model points to replace the selected 
surface element . 
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21. A method as claimed in claim 20 wherein the 
projecting step comprises receiving input data defining 
a centre of projection and projecting the new model point 
onto the model in a direction of projection along a ray 

5 generated through the centre of projection and the new 
model point. 

22. A method as claimed in claim 21 wherein the existing 
set of model data is generated by processing image data 

10 representative of camera images of an object to be 
modelled. 

23. A method as claimed in claim 22 wherein the step of 
receiving input data comprises receiving an image 

15 selection signal for selecting one of said camera images , 
and defining the centre of projection for projecting the 
model point to be co-ordinates representative of a camera 
position from which the selected camera image was taken. 

20 

24. A method as claimed in claim 23 including the step 
of displaying a set of camera images and receiving a 
selection signal responsive to user actuation of an input 
means to select the selected camera image. 

25 

25. A method as claimed in claim 22 wherein the step of 
receiving input data comprises receiving an image 
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selection signal for selecting one of said camera images, 
and receiving an image point selection signal defining 
co-ordinates of an image point in said selected camera 
image corresponding to the new model point, 

5 

26. A method as claimed in claim 25 including the step 
of calculating in the three dimensional space co- 
ordinates of the centre of projection to correspond to 
the position of the image point in an image plane of the 

10 camera. 

27. A method as claimed in any of claims 21 to 26 
including the step of determining whether a plurality of 
surface elements are intersected by the ray and, if so, 

15 determining the selected surface to be whichever of the 
intersected surface elements is closest to the new model 
point . 

28. A method as claimed in any of claims 20 to 27 
2 0 wherein the surface elements comprise triangular facets 

and wherein each subset of the model points defining the 
generation of the selected surface element comprises 
three model points constituting apices of the triangular 
facets . 

25 

29. Apparatus for generating model data defining a model 
in a three dimensional space, the model data comprising 
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co-ordinates defining model points and surface elements 
generated with reference to the model points , the 
apparatus being operable to edit an existing set of model 
data and comprising; 
5 means for adding a new model point to the existing 

set of model data; 

means for projecting the new model point onto the 
model and identifying a selected one of the surface 
elements onto which the new model point is projected; 
10 means for identifying a subset of the model points 

which define the generation of the selected surface 
element; 

means for adding the new model point to the subset 
to form an edited subset of model points; and 
15 means for generating one or more edited surface 

elements from the edited subset of model points to 
replace the selected surface element. 

30. Apparatus as claimed in claim 29 wherein the 
20 projecting means comprises receiving means for receiving 

input data defining a centre of projection, the 
projecting means being operable to project the new model 
point onto the model in a direction of projection along 
a ray generated through the centre of projection and the 
2 5 new model point. 

31. Apparatus as claimed in claim 30 wherein the 
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existing set of model data is' generated by processing 
image data representative of camera images of an object 
to be modelled. 

5 32. Apparatus as claimed in claim 31 wherein the 
receiving means is operable to receive an image selection 
signal for selecting one of said camera images , and to 
define the centre of projection for projecting the model 
point to be co-ordinates representative of a camera 
10 position from which the selected camera image was taken. 

33. Apparatus as claimed in claim 32 comprising 
interface means for displaying a set of camera images and 
receiving a selection signal responsive to user actuation 

15 of an input means to select the selected camera image. 

34. Apparatus as claimed in claim 31 wherein the 
receiving means is operable to receive an image selection 
signal for selecting one of said camera images , and to 

2 0 receive an image point selection signal defining co- 
ordinates of an image point in said selected camera image 
corresponding to the new model point. 

35. Apparatus as claimed in claim 34 including 
25 calculating means for calculating in the three 

dimensional space co-ordinates of the centre of 
projection to correspond to the position of the image 
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point in an image plane of the camera. 

36- Apparatus as claimed in any of claims 30 to 35 
including means for determining whether a plurality of 
5 surface elements are intersected by the ray and, if so, 
determining the selected surface to be whichever of the 
intersected surface elements is closest to the new model 
point . 

10 37. Apparatus as claimed in any of claims 30 to 36 
wherein the surface elements comprise triangular facets 
and wherein each subset of the model points defining the 
generation of the selected surface element comprises 
three model points constituting apices of the triangular 

15 facets. 

38. A computer program comprising processor 
implementable instructions for carrying out a method as 
claimed in any of claims 20 to 28. 

20 

39. A storage medium storing processor implementable 
instructions for controlling a processor to carry out a 
method as claimed in any of claims 20 to 28. 

25 40. An electrical signal carrying processor 
implementable instructions for controlling a processor to 
carry out a method as claimed in any of claims 2 0 to 28. 
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41. A method of operating an' apparatus for generating 
model data representative of a three dimensional model of 
an object from input signals representative of a set of 
camera images of the object taken from a plurality of 

5 camera positions, the method comprising; 

displaying a set of icons, each being associated 
with a respective one of the camera images of the object; 

receiving a selection signal responsive to user 
actuation of an input means whereby the selection signal 
10 identifies a selected one of the icons; 

determining a selected camera image from the set of 
camera images corresponding to the selected icon; 
displaying the selected image; 

determining position data representative of a 
15 selected camera position from which the selected image 
was taken; 

generating in accordance with said model a model 
image representative of a view of the model from a 
viewpoint corresponding to the position data; and 
20 displaying the model image for visual comparison 

with the selected image by the user. 

42. A method as claimed in claim 41 including the step 
of generating the icons in response to receiving a mode 

25 selection input. 

43. A method as claimed in any of claims 41 and 42 
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wherein the icons are generated as thumbnail images of 
the respective camera images. 

44. A method as claimed in claim 43 wherein the step of 
5 displaying the set of icons comprises displaying the 

icons in an array and displaying links between the icons 
such that each pair of icons corresponding to adjacent 
camera positions in a positional sequence of the camera 
positions is joined by a respective link. 

10 

45. A method as claimed in claim 44 wherein the icons 
are displayed in a linear array. 

46. A method as claimed in any of claims 41 to 45 
15 wherein the selected camera image and the model image are 

displayed in respective windows and including the step of 
providing relative movement of the windows in response to 
receiving window movement input signals. 

20 47. A method as claimed in claim 46 wherein the icons 
are displayed in a further window and including the step 
of facilitating movement of the further window relative 
to the image windows in response to window movement input 
signals . 

25 

48. A method as claimed in any of claims 41 to 47 
comprising generating the selection signal by operation 
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of a pointing means for user actuation in selecting one 
of the displayed icons. 

49. A method as claimed in claim 41 wherein displaying 
5 the set of icons comprises displaying a view of the model 

from a viewpoint in which the icons comprise 
representations of cameras and are shown at respective 
positions relative to the model which correspond 
substantially to the camera positions relative to the 
10 object. 

50. Apparatus for generating model data representative 
of a three dimensional model of an object from input 
signals representative of a set of camera images of the 

15 object taken from a plurality of camera positions, the 
appar atu s compri sing ; 

display means for displaying a set of icons, each 
being associated with a respective one of the camera 
images of the object; 
20 means for receiving a selection signal responsive to 

user actuation of an input means whereby the selection 
signal identifies a selected one of the icons; 

means for determining a selected camera image from 
the set of camera images corresponding to the selected 
25 icon whereby the display means is operable to display the 
selected image; 

means for determining position data representative 
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of a selected camera position' from which the selected 
image was taken; 

means for generating in accordance with said model 
a model image representative of a view of the model from 
5 a viewpoint corresponding to the position data; and 
control means for controlling the display means to 
display the model image for visual comparison with the 
selected image by the user. 

10 51. Apparatus as claimed in claim 50 further comprising 
means for generating the icons in response to receiving 
a mode selection input. 

52. Apparatus as claimed in any of claims 50 and 51 
15 wherein icon generating means is operable to generate the 

icons as thumbnail images of the respective camera 
images - 

53. Apparatus as claimed in claim 52 wherein the control 
2 0 means is operable to control the display means to display 

the set of icons in an array and to display links between 
the icons such that each pair of icons corresponding to 
adjacent camera positions in a positional sequence of the 
camera positions is joined by a respective link. 

25 

54. Apparatus as claimed in claim 53 wherein the control 
means is operable to control the display means to 
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display the icons in a linear array. 

55. Apparatus as claimed in any of claims 50 to 54 
wherein control means is operable control the display 
5 means to display the selected camera image and the model 
image in respective windows and to provide relative 
movement of the windows in response to receiving window 
movement input signals. 

10 56. Apparatus as claimed in claim 55 wherein the control 
means is operable to control the display means to display 
the icons in a further window to facilitate movement of 
the further window relative to the camera image window 
and model image window in response to window movement 

15 input signals. 

57. Apparatus as claimed in any of claims 50 to 56 
wherein the means for generating the selection signal 
comprises a pointing means for user actuation in 
2 0 selecting one of the displayed icons. 



58. Apparatus as claimed in claim 50 wherein the control 
means is operable to control the display means for 
displaying the set of icons by displaying a view of the 
2 5 model from a viewpoint in which the icons comprise 
representations of cameras and are shown at respective 
positions relative to the model which correspond 
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substantially to the camera positions relative to the 
object . 

59. A computer program comprising processor 
5 implementable instructions for carrying out a method as 

claimed in any of claims 41 to 49. 

60. A storage medium storing processor implementable 
instructions for controlling a processor to carry out a 

10 method as claimed in any of claims 41 to 49. 

61. An electrical signal carrying processor 
implementable instructions for controlling a processor to 
carry out a method as claimed in any of claims 41 to 49. 

15 

62. In a method of operating an apparatus for generating 
model data representative of a model in a three 
dimensional space from image data representative of a set 
of camera images of an object, an improvement comprising; 

20 the apparatus performing the steps of; 

displaying a model image based on an existing set of 
model data; 

displaying one of the camera images of the object 
for selection by a user of an additional feature to be 
2 5 represented by additional model data; 

receiving an image point selection signal responsive 
to user actuation of an input means and identifying co- 



84 



2636201 



ordinates of an image point in 'the camera image defining 
the selected additional feature; 

calculating a locus in the three dimensional space 
defining positions of possible model points corresponding 
5 to the image point and consistent with the geometric 
relationship between the object and a camera position 
from which the displayed camera image was taken; 

displaying a position indicator in the model image 
at co-ordinates in the model image corresponding to one 
10 of the possible model points on the locus; 

receiving positioning signals responsive to user 
actuation of the input means and updating the co- 
ordinates of the position indicator such that movement of 
the position indicator is constrained to follow a 
15 trajectory in the model image corresponding to the locus; 

receiving a model point selecting signal responsive 
to user actuation of the input means and determining 
selected co-ordinates of the position indicator to be the 
position indicator co-ordinates at the time of receiving 
20 the model point selecting signal; and 

determining co-ordinates of the additional model 
point in the three dimensional space corresponding to the 
selected co-ordinates of the position indicator. 

25 63. In an apparatus for generating model data 
representative of a model in a three dimensional space 
from image data representative of a set of camera images 
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of an object; 

an improvement wherein the apparatus comprises; 
an interface comprising display means operable to 
display images to a user and input means responsive to 
5 user actuation; 

control means operable to control the display means 
to display a model image based on an existing set of 
model data and to display one of the camera images of the 
object for selection by a user of an additional feature 
10 to be represented by additional model data; 

receiving means for receiving an image point 
selection signal responsive to user actuation of the 
input means and identifying co-ordinates of an image 
point in the camera image defining the selected 
15 additional feature; 

calculating means for calculating a locus in the 
three dimensional space defining positions of possible 
model points corresponding to the image point and 
consistent with the geometric relationship between the 
20 object and a camera position from which the displayed 
camera image was taken; 

the control means being further operable to control 
the display means to display a position indicator in the 
model image at co-ordinates in the model image 
25 corresponding to one of the possible model points on the 
locus; 

the apparatus further comprising means for receiving 
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positioning signals responsive to user actuation of the 
input means and updating the co-ordinates of the position 
indicator such that movement of the position indicator is 
constrained to follow a trajectory in the model image 
5 corresponding to the locus; 

means for receiving a model point selecting signal 
responsive to user actuation of the input means and 
determining selected co-ordinates of the position 
indicator to be the position indicator co-ordinates at 
10 the time of receiving the model point selecting signal; 
and 

means for determining co-ordinates of the additional 
model point in the three dimensional space corresponding 
to the selected co-ordinates of the position indicator. 

15 

64. In an apparatus for generating model data 
representative of a model in a three dimensional space 
from image data representative of a set of camera images 
of an object , a method wherein; 
2 0 the apparatus performs the steps of; 

displaying a model image based on an existing set of 
model data; 

displaying one of the camera images of the object 
for selection by a user of an additional feature to be 
2 5 represented by additional model data; 

receiving an image point selection signal responsive 
to user actuation of an input means and identifying co- 
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ordinates of an image point in' the camera image defining 
the selected additional feature; 

calculating a locus in the three dimensional space 
defining positions of possible model points corresponding 
5 to the image point and consistent with the geometric 
relationship between the object and a camera position 
from which the displayed camera image was taken; 

displaying a position indicator in the model image 
at co-ordinates in the model image corresponding to one 
10 of the possible model points on the locus; 

receiving positioning signals responsive to user 
actuation of the input means and updating the co- 
ordinates of the position indicator such that movement of 
the position indicator is constrained to follow a 
15 trajectory in the model image corresponding to the locus; 

receiving a model point selecting signal responsive 
to user actuation of the input means and determining 
selected co-ordinates of the position indicator to be the 
position indicator co-ordinates at the time of receiving 
20 the model point selecting signal; and 

determining co-ordinates of the additional model 
point in the three dimensional space corresponding to the 
selected co-ordinates of the position indicator. 

25 65. In a method of operating an apparatus for generating 
model data defining a model in a three dimensional space, 
the model data comprising co-ordinates defining model 
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points and surface elements generated with reference to 
the model points; an improvement wherein the method 
comprises editing an existing set of model data by the 
steps of; 

5 adding a new model point to the existing set of 

model data; 

projecting the new model point onto the model and 
identifying a selected one of the surface elements onto 
which the new model point is projected; 
10 identifying a subset of the model points which 

define the generation of the selected surface element; 

adding the new model point to the subset to form an 
edited subset of model points; and 

generating one or more edited surface elements from 
15 the edited subset of model points to replace the selected 
surface element. 

66. In an apparatus for generating model data defining 
a model in a three dimensional space , the model data 

20 comprising co-ordinates defining model points and surface 
elements generated with reference to the model points, an 
improvement wherein the apparatus is operable to edit an 
existing set of model data and comprises; 

means for adding a new model point to the existing 

2 5 set of model data; 

means for projecting the new model point onto the 
model and identifying a selected one of the surface 



89 



2636201 



elements onto which the new model point is projected; 

means for identifying a subset of the model points 
which define the generation of the selected surface 
element; 

5 means for adding the new model point to the subset 

to form an edited subset of model points; and 

means for generating one or more edited surface 
elements from the edited subset of model points to 
replace the selected surface element • 

10 

67. In an apparatus for generating model data defining 
a model in a three dimensional space, the model data 
comprising co-ordinates defining model points and surface 
elements generated with reference to the model points; a 
15 method comprising editing an existing set of model data 
by the steps of; 

adding a new model point to the existing set of 
model data; 

projecting the new model point onto the model and 
2 0 identifying a selected one of the surface elements onto 
which the new model point is projected; 

identifying a subset of the model points which 
define the generation of the selected surface element; 

adding the new model point to the subset to form an 
2 5 edited subset of model points; and 

generating one or more edited surface elements from 
the edited subset of model points to replace the selected 
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surface element. 

69. In a method of operating an apparatus for generating 
model data representative of a three dimensional model of 

5 an object from input signals representative of a set of 
camera images of the object taken from a plurality of 
camera positions , an improvement wherein the method 
comprises; 

displaying a set of icons, each being associated 
10 with a respective one of the camera images of the object; 

receiving a selection signal responsive to user 
actuation of an input means whereby the selection signal 
identifies a selected one of the icons; 

determining a selected camera image from the set of 
15 camera images corresponding to the selected icon; 
displaying the selected image; 

determining position data representative of a 
selected camera position from which the selected image 
was taken; 

20 generating in accordance with said model a model 

image representative of a view of the model from a 
viewpoint corresponding to the position data; and 

displaying the model image for visual comparison 
with the selected image by the user. 

25 

70. In an apparatus for generating model data 
representative of a three dimensional model of an object 
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from input signals representative of a set of camera 
images of the object taken from a plurality of camera 
positions , an improvement wherein the apparatus 
comprises; 

5 display means for displaying a set of icons , each 

being associated with a respective one of the camera 
images of the object; 

means for receiving a selection signal responsive to 
user actuation of an input means whereby the selection 
10 signal identifies a selected one of the icons; 

means for determining a selected camera image from 
the set of camera images corresponding to the selected 
icon whereby the display means is operable to display the 
selected image ; 

15 means for determining position data representative 

of a selected camera position from which the selected 
image was taken; 

means for generating in accordance with said model 
a model image representative of a view of the model from 

20 a viewpoint corresponding to the position data; and 
control means for controlling the display means to 
display the model image for visual comparison with the 
selected image by the user. 

25 71. In an apparatus for generating model data 
representative of a three dimensional model of an object 
from input signals representative of a set of camera 
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images of the object taken from a plurality of camera 
positions, a method comprising; 

displaying a set of icons, each being associated 
with a respective one of the camera images of the object; 
5 receiving a selection signal responsive to user 

actuation of an input means whereby the selection signal 
identifies a selected one of the icons; 

determining a selected camera image from the set of 
camera images corresponding to the selected icon; 
10 displaying the selected image; 

determining position data representative of a 
selected camera position from which the selected image 
was taken; 

generating in accordance with said model a model 
15 image representative of a view of the model from a 
viewpoint corresponding to the position data; and 

displaying the model image for visual comparison 
with the selected image by the user. 
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ABSTRACT 

METHOD AND APPARATUS FOR GENERATING 
MODEL DATA FROM CAMERA IMAGES 
A three-dimensional model of an object is generated 
from image data representing a set of camera images of 
the object. Refinement of the model is carried out on 
the basis of a comparison of a displayed model image and 
a displayed camera image, enabling a user to select co- 
ordinates in the camera image defining the selection of 
an additional feature which is to be added to the model. 
The model calculates a locus in the three-dimensional 
space defining positions of possible model points 
corresponding to the selected image point and consistent 
with the geometric relationship between the object and a 
camera position from which the displayed camera image was 
taken. A position indicator is displayed in the model 
image at co-ordinates on the locus and the position 
indicator is moveable by the user in a manner which is 
constrained to follow a trajectory in the model image 
corresponding to the locus. The user may then select a 
required position of the new model point on the locus and 
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the model data is updated to include the new model point 
corresponding to such selection. The surface of the 
model may be edited to incorporate a new model point by 
identifying a facet by projection onto the model and 
automatically re-facetting the model. Evaluation of the 
model image requires presenting corresponding camera and 
model images, the model viewpoint being determined 
automatically by user selection of camera frames or 
camera position. 
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Fig 2 

MODEL WINDOW SHOWS FEATURE MISSING FROM 
MODEL IMAGE WHEN COMPARED WITH CAMERA IMAGE 
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Fig 3 IN CAMERA IMAGE WINDOW ,USER SELECTS NEW 

IMAGE POINT FOR ADDITION TO MODEL 
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Fig 4 

MODEL CALCULATES LOCUS OF NEW MODEL POINT 



Fig 5 




IN MODEL WINDOW,MODEL IMAGE IS DISPLAYED FROM A 
DIFFERENT VIEWING ANGLE AND USER MOVES NEW MODEL 
POINT TO REQUIRED POSITION .CONSTRAINED BY 
CALCULATED LOCUS 
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Fig 6 
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IN MODEL WINDOW, USER SELECTS EXISTING MODEL 
POINTS FOR CONNECTION TO NEW POINT 

108 




Fig 7 



IN MODEL WINDOW, MODEL IMAGE NOW SHOWS NEW 
MODEL POINT AND FACETS 
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Fig 8 



MODEL IMAGE WITH ADDED MODEL DATA , VIEWED FROM 
^ SAME DIRECTION AS CAMERA IMAGE 
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FIG. 9 
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USER VIEWS IMAGES 
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DISPLAY MODEL IMAGE AND 
CAMERA IMAGE 
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USER SELECTS AN IMAGE 

POINT IN THE CAMERA 
IMAGE USING A POINTING 
MEANS 



RECEIVE IMAGE POINT 
SELECTION SIGNAL AND 
CALCULATE CO-ORDINATES 
IN CAMERA IMAGE 
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CALCULATE LOCUS OF 3-D 
POINT CORRESPONDING TO 
CAMERA IMAGE CO- 
ORDINATES 



USER VIEWS MODEL 
IMAGE AND POSITION 
INDICATOR 
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DISPLAY IN MODEL IMAGE 
WINDOW A POSITION 
INDICATOR AT CO- 
ORDINATES IN THE MODEL 
IMAGE CORRESPONDING TO 
A POINT ON THE LOCUS 



USER ACTUATES 
POINTING MEANS TO 
INTERACTIVELY MOVE THE 
POSITION INDICATOR 
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RECEIVE INPUT OF POINTING 
SIGNALS AND MOVE 

LOCATION OF POSITION 
INDICATOR IN MODEL IMAGE 

WINDOW IN REAL TIME 
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USER SELECTS BY 
ACTUATING SELECTOR 
SWITCH THE REQUIRED 
LOCATION OF POSITION 
INDICATOR TO SET 
POSITION OF NEW MODEL 
POINT 



RECEIVE SELECTION INPUT 
SIGNAL AND FREEZE 
POSITION OF POSITION 
INDICATOR IN MODEL IMAGE 
WINDOW 
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USER VIEWS MODEL AND 
NEW MODEL POINT AND 
SELECTS POINTS FOR 
SURFACE GENERATION 
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EXISTING MODEL DATA 
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DISPLAY NEW MODEL POINT 
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Fig 10 
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Fig 11 



DISPLAY OF LINE INDICATING LOCUS/ 
108 — TRAJECTORY 
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Fig 12 

ADDING A NEW MODEL POINT 
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Fig 13A 



CAMERA SELECTION WINDOW WITH CAMERA ICONS 




Fig 13B 

CAMERA SELECTION WINDOW WITH CAMERA IMAGES 
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Fig 14 



CALCULATE FACET INTERSECTED BY RAY THROUGH 
CAMERA POSITION AND ADDED MODEL POINT 
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Fig 16 

DISPLAY NEW MODEL INCLUDING ADDED POINT AND NEW FACETS 
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FIG. 18 



USER 



GENERATING NEW FACETS 
INTERFACE 



NO 



181 



18; 



USER 
SELECTS 
FACET TO 
BE 

REPLACED 



DISPLAY 
PROMPT TO 
USER TO 
SELECT A 
MODEL FACET 



RECEIVE USER 
INPUT OF 

SELECTION OF 
A FACET 



183— > 



DETERMINE 
SET OF 
CO-ORDINATES 
ON WHICH 
SELECTED 
FACET IS 
BASED 



APPARATUS 




SELECT INTERSECTED FACET 
CLOSEST TO CAMERA POSITION 

AND DETERMINE SET OF CO- 
ORDINATES ON WHICH 
INTERSECTED FACET IS BASED 



184^ 



XT" 



185 



ADD NEW MODEL POINT TO SET 
OF CO-ORDINATES 



185-^ 

RETRIANGULATE MODEL 
FACETS BASED ON NEW SET OF 
CO-ORDINATES 



T 



i HIS PAGE BLANK (uspto) 



16/23 



Fig 19 



IDENTIFY COORDINATES IN CAMERA IMAGE OF 
FEATURE CORRESPONDING TO ADDED MODEL POINT 
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Fig 20 



CALCULATE FACET INTERSECTED BY RAY THROUGH 
CAMERA IMAGE POINT AND ADDED MODEL POINT 
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